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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Greene, Marianne 
Blumberg, Bruce 

(ii) TITLE OF INVENTION: Human/Peroxisome Proliferator 

Activated Receptor Gamma: 
Compositions and Methods 

(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS^ 

(A) ADDRESSEE: Arnold, White & Durkee 

(B) STREET: 321 Nortyh Clark Street, Suite 800 

(C) CITY: Chicago 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) ZIP: 60610 

(v) COMPUTER READ ABLE/ FORM : 

(A) MEDIUM TYPE/ Floppy disk 

(B) COMPUTER: EBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE : /Patent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US UNKNOWN 

(B) FILING />ATE: 08-OCT-1993 

(C) CLASSIFICATION: 

/ 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME 'J Coughlin, Daniel F. 

(B) REGISTRATION NUMBER^36 , 111 

(C) REFERENCE / DOCKET NpM|ERj ARCD098 

(ix) TELECOMMUNICATION INFOf 

(A) TELEPHONE: 312-744- 

(B) TELEFAX: 312-755-4489 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A^ LENGTH: 1844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
i/D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) /FEATURE: 

/ (A) NAME /KEY : CDS 
/ (B) LOCATION: 179.. 1603 

(xiA SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCGACCTTAC CCCAGGCGGC CTTGACGTTG GTCTTGTCGG CAGGAGACAG CACCATGGTG 60 

GGTTCTCTCT GAGTCTGGGA ATTCCCGAGC CCGAGCCGCA GCCGCCGCCT GGGGGGCTTG 120 

GGTCGGGCTC GAGGACACCG GAGAGGGGCG CCACGCCGCC GTGGCCGCAG AAATGACC 178 
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ATG GTT 
Met Vat 
1 



GAC ACA 
Asp Thr 



GAG ATC GCA TTC 
Glu lie Ala Phe 
5 



TGG CCC 
Trp Pro 
10 



ACC AAC 
Thr Asn 



TTT GGG ATC AGC 
Phe Gly I le Ser 
15 



TCC GTG 
Ser Val 



GAT CTC 
Asp Leu 
20 



TCC GTA ATG GAA 
Ser Val Met Glu 



GAC CAC 
Asp His 
25 



TCC CAC 
Ser His 



TCC TTT GAT ATC 
Ser Phe Asp I le 
30 



AAG CCC 
Lys Pro 



TTC ACT 
Phe Thr 
35 



ACT GTT GAC TTC 
Thr Vat Asp Phe 
AO 



TCC AGC 
Ser Ser 



ATT TCT 
He Ser 



ACT CCA CAT TAC 
Thr Pro His Tyr 
45 



GAA GAC 
GLu Asp 
50 



ATT CCA 
He Pro 



TTC ACA AGA ACA 
Phe Thr Arg Thr 
55 



GAT CCA 
Asp Pro 



GTG GTT 
Val Vat 
60 



GCA GAT TAC AAG 
Ala Asp Tyr Lys 



TAT GAC 
Tyr Asp 
65 



CTG AAA 
Leu Lys 



CTT CAA GAG TAC 
Leu Gin Glu Tyr 
70 



CAA AGT 
Gin Ser 



GCA ATC 
Ala lie 
75 



AAA GTG 
Lys Val 



GAG CCT 
Gtu Pro 
80 



GCA TCT 
Ala Ser 



CCA CCT 
Pro Pro 



TAT TAT TCT GAG 
Tyr Tyr Ser Glu 
85 



AAG ACT 
Lys Thr 
90 



CAG CTC 
Gin Leu 



TAC AAT 
Tyr Asn 



AAG CCT 
Lys Pro 
95 



CAT GAA 
His Glu 



GAG CCT 
Glu Pro 
100 



TCC AAC TCC CTC 
Ser Asn Ser Leu 



ATG GCA 
Met Ala 
105 



ATT GAA 
lie Glu 



TGT CGT 
Cys Arg 
110 



GTC TGT 
Val Cys 



GGA GAT 
Gly Asp 



AAA GCT 
Lys Ala 
115 



TCT GGA TTT CAC 
Ser Gly Phe His 
120 



TAT GGA 
Tyr Gly 



GTT CAT 
Val His 



GCT TGT 
Ala Cys 
125 



GAA GGA 
Glu Gly 



TGC AAG 
Cys Lys 
130 



GGT TTC 
Gly Phe 



TTC CGG AGA ACA 
Phe Arg Arg Thr 
135 



ATC AGA 
He Arg 



TTG AAG 
Leu Lys 
HO 



CTT ATC 
Leu lie 



TAT GAC 
Tyr Asp 



AGA TGT 
Arg Cys 
145 



GAT CTT 
Asp Leu 



AAC TGT CGG ATC 
Asn Cys Arg lie 
150 



CAC AAA 
His Lys 



AAA AGT 
Lys Ser 
155 



AGA AAT AAA TGT 
Arg Asn Lys Cys 
160 



CAG TAC 
Gin Tyr 



TGT CGG 
Cys Arg 



TTT CAG AAA TGC 
Phe Gin Lys Cys 
165 



CTT GCA 
Leu Ala 
170 



GTG GGG 
Val Gly 



ATG TCT CAT AAT 
Met Ser His Asn 
175 



GCC ATC 
Ala He 



AGG TTT 
Arg Phe 
180 



GGG CGG ATC GCA 
Gly Arg He Ala 



CAG GCC 
Gin Ala 
185 



GAG AAG 
Glu Lys 



GAG AAG CTG TTG 
Glu Lys Leu Leu 
190 



GCG GAG 
Ala Glu 



ATC TCC 
He Ser 
195 



AGT GAT 
Ser Asp 



ATC GAC 
He Asp 
200 



CAG CTG 
Gin Leu 



AAT CCA 
Asn Pro 



GAG TCC GCT GAC 
Gtu Ser Ala Asp 
205 



CTC CGT 
Leu Arg 
210 



CAG GCC 
Gin Ala 



CTG GCA 
Leu Ala 



AAA CAT 

Lys His 
215 



TTG TAT 
Leu Tyr 



GAC TCA 
Asp Ser 
220 



TAC ATA AAG TCC 
Tyr I le Lys Ser 



TTC CCG 
Phe Pro 
225 



CTG ACC 
Leu Thr 



AAA GCA 
Lys Ala 
230 



AAG GCG 
Lys Ala 



AGG GCG 
Arg Ala 



ATC TTG 
He Leu 
235 



ACA GGA AAG ACA 
Thr Gly Lys Thr 
240 



ACA GAC 
Thr Asp 



AAA TCA 
Lys Ser 



CCA TTC 
Pro Phe 
245 



GTT ATC 
Val He 



TAT GAC 
Tyr Asp 
250 



ATG AAT 
Met Asn 



TCC TTA ATG ATG 
Ser Leu Met Met 
255 



GGA GAA 
Gly Glu 



GAT AAA 
Asp Lys 
260 



ATC AAG 
He Lys 



TTC AAA 
Phe Lys 



CAC ATC 
His He 
265 



ACC CCC 
Thr Pro 



CTG CAG GAG CAG 
Leu Gin Glu Gin 
270 



AGC AAA 
Ser Lys 



GAG GTG 
Gtu Val 
275 



GCC ATC CGC ATC 
Ala He Arg He 
280 



TTT CAG 
Phe Gin 



GGC TGC 
Gly Cys 



CAG TTT CGC TCC 
Gin Phe Arg Ser 
285 



4 



9H 



35 



45 
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GTG GAG GCT GTG CAG GAG ATC ACA GAG TAT GCC AAA AGC ATT CCT GGT 1090 
Val Glu Ala Val Gin Glu He Thr Glu Tyr Ala Lys Ser He Pro Gly 
290 295 300 

TTT GTA AAT CTT GAC TTG AAC GAC CAA GTA ACT CTC CTC AAA TAT GGA 1138 
Phe Val Asn Leu Asp Leu Asn Asp Gin Val Thr Leu Leu Lys Tyr Gly 
305 310 315 320 

GTC CAC GAG ATC ATT TAC ACA ATG CTG GCC TCC TTG ATG AAT AAA GAT 1186 
Val His Glu lie lie Tyr Thr Met Leu Ala Ser Leu Met Asn Lys Asp 
325 330 335 



10 GGG GTT CTC ATA TCC GAG GGC CAA GGC TTC ATG ACA AGG GAG TTT CTA 

Gly Val Leu He Ser Glu Gly Gin Gly Phe Met Thr Arg Glu Phe Leu 
340 345 350 



1£ - Lys Ser Leu Arg Lys Pro Phe Gly Asp Phe Met Glu Pro Lys Phe Glu 

15 355 



25 GAG CTC CAG CTG AAG CTG AAC CAC CCT GAG TCC TCA CAG CTG TTT GCC 

Glu Leu Gin Leu Lys Leu Asn His Pro Glu Ser Ser Gin Leu Phe Ala 
420 425 430 



ori Lys Leu Leu Gin Lys Met Thr Asp Leu Arg Gin He Val Thr Glu His 

30 435 



1234 



AAG AGC CTG CGA AAG CCT TTT GGT GAC TTT ATG GAG CCC AAG TTT GAG 1282 

Gly Asp Phe Met Glu Pro 
360 365 



TTT GCT GTG AAG TTC AAT GCA CTG GAA TTA GAT GAC AGC GAC TTG GCA 1330 
Phe Ala Val Lys Phe Asn Ala Leu Glu Leu Asp Asp Ser Asp Leu Ala 
370 375 3S0 

ATA TTT ATT GCT GTC ATT ATT CTC AGT GGA GAC CGC CCA GGT TTG CTG 1378 
" s *' £ «la Val lie He Leu Ser Gly Asp Arg Pro GLy Leu Leu 
385 390 395 400 

AAT GTG AAG CCC ATT GAA GAC ATT CAA GAC AAC CTG CTA CAA GCC CTG 1426 
Asn Val Lys Pro lie Glu Asp He Gin Asp Asn Leu Leu Gin Ala Leu 
405 410 415 



1474 



AAG CTG CTC CAG AAA ATG ACA GAC CTC AGA CAG ATT GTC ACG GAA CAC 1522 

Asp Leu Arg Gin lie Val 
440 445 



GTG CAG CTA CTG CAG GTG ATC AAG AAG ACG GAG ACA GAC ATG AGT CTT 1570 
Vat Gin Leu Leu Gin Vat He Lys Lys Thr Glu Thr Asp Met Ser Leu 
450 455 460 

CAC CCG CTC CTG CAG GAG ATC TAC AAG GAC TTG TAC TAGCAGAGAG 1616 
His Pro Leu Leu Gin Glu He Tyr Lys Asp Leu Tyr 
465 470 475 



TCCTGAGCCA CTGCCAACAT TTCCCTTCTT CCATTTGCAC TATTCTGAGG GAAAATCTGA 1676 

CCATAAGAAA TTTACTGTGA AAAAGCGTTT TAAAAAGAAA AGGGTTTAGA ATATGATCTA 1736 

TTTTATGCAT ATTGTTTATA AAGACACATT TACAATTTAC TTTTAATATT AAAAATTACC 1796 

40 ATATTATGAA ATTGCAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 1844 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Asp Thr Glu lie Ala Phe Trp Pro Thr Asn Phe Gly lie Ser 
15 10 is 
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Ser Val Asp Leu Ser Val Met Glu Asp His Ser His Ser Phe Asp lie 
20 25 30 

Lys Pro Phe Thr Thr Val Asp Phe Ser Ser lie Ser Thr Pro His Tyr 
35 40 45 

Glu Asp He Pro Phe Thr Arg Thr Asp Pro Val Val Ala Asp Tyr Lys 
50 55 60 

Tyr Asp Leu Lys Leu Gin Glu Tyr Gin Ser Ala He Lys Val Glu Pro 
65 70 75 80 

Ala Ser Pro Pro Tyr Tyr Ser Glu Lys Thr Gin Leu Tyr Asn Lys Pro 
85 90 95 

His Glu Glu Pro Ser Asn Ser Leu Met Ala He Glu Cys Arg Val Cye 
100 105 110 

Gly Asp Lys Ala Ser Gly Phe His Tyr Gly Val His Ala Cys Glu Gly 
115 120 125 

Cys Lys Gly Phe Phe Arg Arg Thr He Arg Leu Lys Leu lie Tyr Asp 
130 135 140 

Arg Cys Asp Leu Asn Cys Arg He His Lys Lys Ser Arg Asn Lys Cys 
145 150 155 160 

Gin Tyr Cys Arg Phe Gin Lys Cys Leu Ala Val Gly Met Ser His Asn 
165 170 175 

Ala He Arg Phe Gly Arg He Ala Gin Ala Glu Lys Glu Lys Leu Leu 
180 185 190 

Ala Glu He Ser Ser Asp He Asp Gin Leu Asn Pro Glu Ser Ala Asp 
195 200 205 

Leu Arg Gin Ala Leu Ala Lys His Leu Tyr Asp Ser Tyr He Lys Ser 
210 215 220 

Phe Pro Leu Thr Lys Ala Lys Ala Arg Ala lie Leu Thr Gly Lys Thr 
225 230 235 240 

Thr Asp Lys Ser Pro Phe Val lie Tyr Asp Met Asn Ser Leu Met Met 
245 250 255 

Gly Glu Asp Lys He Lys Phe Lys His lie Thr Pro Leu Gin Glu Gin 
260 265 270 

Ser Lys Glu Val Ala lie Arg lie Phe Gin Gly Cys Gin Phe Arg Ser 
275 280 285 

Val Glu Ala Val Gin Glu lie Thr Glu Tyr Ala Lys Ser He Pro Gly 
290 295 300 

Phe Val Asn Leu Asp Leu Asn Asp Gin Val Thr Leu Leu Lys Tyr Gly 
305 310 315 320 

Val His Glu lie lie Tyr Thr Met Leu Ala Ser Leu Met Asn Lys Asp 
325 330 335 

Gly Val Leu lie Ser Glu Gly Gin Gly Phe Met Thr Arg Glu Phe Leu 
340 345 350 

Lys Ser Leu Arg Lys Pro Phe Gly Asp Phe Met Glu Pro Lys Phe Glu 
355 360 365 
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Phe Ala Val Lys Phe Asn Ala Leu Glu Leu Asp Asp Ser Asp Leu Ala 
370 375 380 

lie Phe lie Ala Val lie lie Leu Ser Gly Asp Arg Pro Gly Leu Leu 
385 390 395 400 

Asn Val Lys Pro lie Glu Asp lie Gin Asp Asn Leu Leu Gin Ala Leu 
405 410 415 

Glu Leu Gin Leu Lys Leu Asn His Pro Glu Ser Ser Gin Leu Phe Ala 
420 425 430 

Lys Leu Leu Gin Lys Met Thr Asp Leu Arg Gin lie Val Thr Glu His 
435 440 445 

Val Gin Leu Leu Gin Val lie Lys Lys Thr Glu Thr Asp Met Ser Leu 
450 455 460 

His Pro Leu Leu Gin Glu lie Tyr Lys Asp Leu Tyr 
465 470 475 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ACTAGTGCGG CCGCCTAGGC CTCGATTTT TTTTTTTTTT T 41 



(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

( ix) FEATURE: 

(A) NAME /KEY : Y 

(B) LOCATION: 3, 12 and 21 

(C) IDENTIFICATION METHOD: Y 

( ix) FEATURE: 

(A) NAME /KEY : R 

(B) LOCATION: 6, and 15 

(C) IDENTIFICATION METHOD: R 



= C or T 



= A or G 



(ix) FEATURE: 

(A) NAME/KEY: N 

(B) LOCATION: 9 and 18 

(C) IDENTIFICATION METHOD: 



N = G, A, C or T 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



-J 
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TGYGARGGNT GYAARGGNTT YTT 
23 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Cys Glu Gly Cys Lys Gly Phe Phe 

1 . 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 19 base pairs 

(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ACGTGACCTT TGTCCTGGT 19 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 6 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
30 TGACCT 6 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE : amino acid 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Thr lie Arg Leu Lys 
40 1 5 10 15 



